ML-Regression

Linear Regression is used for regression tasks to predict continuous outputs by minimizing the mean squared error. Ridge Regression is a regularized version of linear regression that adds a penalty term to the loss function to prevent overfitting. Logistic Regression is used for binary classification tasks to predict probabilities and classify data points.

Linear Regression

1. Hypothesis

Linear regression models the output $y$ as a linear function of the input features $x$ :

h_{θ} (x) = θ^{T} x = θ_{0} + θ_{1} x_{1} + \dots + θ_{n} x_{n}

2. Cost Function

The cost function used is the Mean Squared Error (MSE):

J (θ) = \frac{1}{2 m} \sum_{i = 1}^{m} {(h_{θ} (x^{(i)}) - y^{(i)})}^{2}

3. Optimization

Gradient Descent: $θ_{j} := θ_{j} - α \frac{1}{m} \sum_{i = 1}^{m} (h_{θ} (x^{(i)}) - y^{(i)}) x_{j}^{(i)}$
Normal Equation: $θ = {(X^{T} X)}^{- 1} X^{T} y$

Logistic Regression

1. Hypothesis

Logistic regression maps the linear function to a probability using the sigmoid function:

h_{θ} (x) = g (θ^{T} x), g (z) = \frac{1}{1 + e^{- z}}

Prediction Rule:
- Predict $y = 1$ if $h_{θ} (x) \geq 0.5$ .
- Predict $y = 0$ if $h_{θ} (x) < 0.5$ .

2. Cost Function

The cost function for logistic regression is:

J (θ) = \frac{1}{m} \sum_{i = 1}^{m} [- y^{(i)} \log (h_{θ} (x^{(i)})) - (1 - y^{(i)}) \log (1 - h_{θ} (x^{(i)}))]

3. Optimization

Gradient Descent: $θ_{j} := θ_{j} - α \frac{1}{m} \sum_{i = 1}^{m} (h_{θ} (x^{(i)}) - y^{(i)}) x_{j}^{(i)}$

4. Sigmoid Function Properties

$g (z)$ maps inputs to $[0, 1]$ , representing probabilities.
Derivative: $g^{'} (z) = g (z) (1 - g (z))$

Ridge Regression

岭回归是一种线性回归的正则化变体，通过在损失函数中加入 $L 2$ 正则化项来解决 多重共线性 和 过拟合 问题。

Loss function

岭回归的目标是最小化以下损失函数：

J (θ) = \frac{1}{2 m} \sum_{i = 1}^{m} {(h_{θ} (x^{(i)}) - y^{(i)})}^{2} + λ \sum_{j = 1}^{n} θ_{j}^{2}

$λ$ : 正则化参数，控制正则化强度。
- $λ = 0$ : 等价于普通的线性回归。
- $λ \to \infty$ : 所有 $θ_{j} \to 0$ 。
$\sum_{j = 1}^{n} θ_{j}^{2}$ : $L 2$ 正则化项，用于约束参数大小，防止过拟合。

岭回归有一个闭式解，可以通过修改普通线性回归的正规方程得到：

θ = {(X^{T} X + λ I)}^{- 1} X^{T} y

$X$ : 设计矩阵。
$I$ : 单位矩阵。

梯度下降解通过梯度下降最小化损失函数：

θ_{j} := θ_{j} - α (\frac{1}{m} \sum_{i = 1}^{m} (h_{θ} (x^{(i)}) - y^{(i)}) x_{j}^{(i)} + 2 λ θ_{j})

正则化部分 $2 λ θ_{j}$ 是对 $θ$ 的惩罚。

Logistic Regression

h_{θ} = g (θ^{T} x)

g (z) = \frac{1}{1 + e^{- z}}

Suppose predict

" $y = 1$ " if $h_{θ} (x) \geq 0.5$ $(θ^{T} x \geq 0)$ " $y = 0$ " if $h_{θ} (x) < 0.5$ $(θ^{T} x < 0)$

θ^{T} x = 0

Cost Function and gradient

The cost function for logistic regression is defined as:

C o s t (h_{θ} (x), y) = {\begin{cases} - \log (h_{θ} (x)) & if y = 1 \\ - \log (1 - h_{θ} (x)) & if y = 0 \end{cases}

When $y = 1$ :

$C o s t = 0$ if $h_{θ} (x) = 1$ .
As $h_{θ} (x) \to 0$ , $C o s t \to \infty$ .
When $h_{θ} (x) = 0$ , the model predicts $P (y = 1 | x; θ) = 0$ , but $y = 1$ .

When $y = 0$ :

$C o s t = 0$ if $h_{θ} (x) = 0$
$C o s t$ goes to infinite if $h_{θ} (x) = 1$

Simplification of Logistic Regression Cost Function

The overall cost function for logistic regression is defined as:

J (θ) = \frac{1}{m} \sum_{i = 1}^{m} C o s t (h_{θ} (x^{(i)}), y^{(i)})

Cost Function:

C o s t (h_{θ} (x), y) = {\begin{cases} - \log (h_{θ} (x)) & if y = 1 \\ - \log (1 - h_{θ} (x)) & if y = 0 \end{cases}

Note: $y$ is always 0 or 1.

J (θ) = \frac{1}{m} \sum_{i = 1}^{m} [- y^{(i)} \log (h_{θ} (x^{(i)})) - (1 - y^{(i)}) \log (1 - h_{θ} (x^{(i)}))]

To fit parameters $θ$ : $min_{θ} J (θ)$

To make a prediction given new $x$ :

Output $h_{θ} (x) = \frac{1}{1 + e^{- θ^{T} x}}$

The cost function for logistic regression is:

J (θ) = \frac{1}{m} \sum_{i = 1}^{m} [- y^{(i)} \log (h_{θ} (x^{(i)})) - (1 - y^{(i)}) \log (1 - h_{θ} (x^{(i)}))]

Gradient Descent Algorithm:

Repeat:

θ_{j} := θ_{j} - α \frac{\partial}{\partial θ_{j}} J (θ)

θ_{j} := θ_{j} - α \frac{1}{m} \sum_{i = 1}^{m} (h_{θ} (x^{(i)}) - y^{(i)}) x_{j}^{(i)}

(Simultaneously update all $θ_{j}$ ).

g (z) = \frac{1}{1 + e^{- z}}

Derivative:

g^{'} (z) = g (z) (1 - g (z))

h_{θ} (x) = \frac{1}{1 + e^{- θ^{T} x}}

Algorithm

Tutorial

assignment

Assignment

As-1

As-2

Lab-1

Lab-2

Lab-3

Lab-4

GAMES101

Assignment-1

Assignment-2

Assignment-3

Assignment-4

Lab

Lecture

Peoject

CSCN

Ploidy

ML-Regression ​

Linear Regression ​

1. Hypothesis ​

2. Cost Function ​

3. Optimization ​

Logistic Regression ​

1. Hypothesis ​

2. Cost Function ​

3. Optimization ​

4. Sigmoid Function Properties ​

Ridge Regression ​

Loss function ​

Logistic Regression ​

Cost Function and gradient ​

Cost Function: ​

Gradient Descent Algorithm: ​

ML-Regression

Linear Regression

1. Hypothesis

2. Cost Function

3. Optimization

Logistic Regression

1. Hypothesis

2. Cost Function

3. Optimization

4. Sigmoid Function Properties

Ridge Regression

Loss function

Logistic Regression

Cost Function and gradient

Cost Function:

Gradient Descent Algorithm: